Automatic Consistency Checking of Table and Text in Financial Documents

نویسندگان

چکیده

A company's financial documents use tables along with text to organize the data containing key performance indicators (KPIs) (such as profit and loss) a quantity linked them. The KPI’s in table might not be equal similarly described KPI's text. Auditors take substantial time manually audit these mistakes this process is called consistency checking. As compared existing work, paper attempts automate task help of transformer-based models. Furthermore, for checking it essential table's KPIs embeddings encode semantic knowledge structural table. Therefore, proposes pipeline that uses tabular model get embeddings. takes input KPIs, generates their embeddings, then checks whether are identical. evaluated on German language comparative analysis cell embeddings' quality from three models also presented. From evaluation results, experiment used English-translated Tabbie generate KPIs’ achieved an accuracy 72.81% task, outperforming benchmark, other

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Consistency Checking of Financial Derivatives Transactions

Financial institutions are increasingly using XML as a de-facto standard to represent and exchange information about their products and services. Their aim is to process transactions quickly, cost-effectively, and with minimal human intervention. Due to the nature of the financial industry, inconsistencies inevitably appear throughout the lifetime of a financial transaction and their resolution...

متن کامل

ideological and cultural orientations in translation of narrative text: the case of hajji baba of isfahan

در میان عواملی که ممکن است ذهن مترجم را هنگام ترجمه تحت تأثیر قرار دهند، می توان به مقوله انتقال ایدئولوژی از طریق متن یا گفتمان اشاره کرد. هدف از این تحقیق تجزیه و تحلیل جنبه های ایدئولوژیکی و فرهنگی متن مبدأ انگلیسی نوشته جیمز موریه تحت عنوان سرگذشت حاجی بابای اصفهانی ( 1823) و ترجمه فارسی میرزا حبیب اصفهانی(1880) بوده است.

Extracting Financial Information from Text Documents

The majority of electronic data today is in textual form. Financial data such as articles in the Wall Street Journal are written as texts. These electronic documents contain a wealth of information but require human interpretation. For financial analysis, rapid up-to-date information is critical. Most software tools currently require data which are better structured than text (such as data in r...

متن کامل

Automatic Dimensional Consistency Checking for Simulation Specifications

Simulation specification languages usually have support for units or dimensions, but seldom use it for more then presenting simulation results. We will show that this annotation can be used to analyze the specifications and as a result eliminate dimensional errors in the specification equations and expressions. We use well known theories on dimensions and type systems to achieve a sound and com...

متن کامل

Improving the Automatic Retrieval of Text Documents

This paper reports on a statistical stemming algorithm based on link analysis. Considering that a word is formed by a prefix (stem) and a suffix, the key idea is that the interlinked prefixes and suffixes form a community of sub-strings. Thus, discovering these communities means searching for the best word splits that give the best word stems. The algorithm has been used in our participation in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the Northern Lights Deep Learning Workshop

سال: 2023

ISSN: ['2703-6928']

DOI: https://doi.org/10.7557/18.6816